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ABSTRACT 


Corners  are  very  useful  features  for  such  purposes  as 
image  matching  or  shape  analysis,  but  corner  detection  is  a 
relatively  expensive  opera cion.  This  paper  uses  filtered  x 
and  y  projections,  applied  to  an  image  containing  an  object 
that  has  not  been  explicitly  segmented  from  its  background, 
to  determine  possible  positions  of  corners,  so  that  corner 
detection  can  oe  applied  only  in  the  vicinity  of  these  posi¬ 
tions.  Even  in  cases  where  the  object  would  be  hard  to  seg¬ 
ment  (unimoaal  histogram) ,  this  approach  yields  a  good  set 
of  possible  corner  positions. 
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1.  Introduction 

Corner  points  in  an  image,  at  which  the  gradient  magnitude 
and  the  rate  of  change  of  gradient  direction  are  both  high,  are 
useful  features  for  image  matching,  because  they  yield  sharp 
matches.  The  corners  of  objects  that  have  been  extracted  from 
an  image  are  also  very  useful  in  shape  analysis,  e.g.,  as  ver¬ 
tex  positions  in  polygonal  approximation.  However,  corner 
points  in  an  image  are  relatively  expensive  to  detect  [1,2] , 
since  they  require  computing  higher-order  difference  operators 
in  every  position.  Corner  detection  is  much  less  expensive 
after  an  object  has  been  extracted  from  the  image;  but  it  may 
be  difficult  to  extract  the  object  cleanly  in  order  to  detect 
its  corners  if  the  image  is  noisy,  e.g.,  if  it  has  a  unimodal 
histogram. 

This  paper  describes  a  method  of  using  the  x  and  y  projections 
of  an  image  to  detect  possible  positions  in  the  image  where  cor¬ 
ners  may  be  present.  Corners  of  significant  size  should  give  rise 
to  slope  discontinuities  in  the  projections,  and  a  filtering  process 
(such  as  those  used  in  image  reconstruction)  can  be  used  to  pro¬ 
duce  peaks  at  these  discontinuities.  It  should  be  possible  to 
detect  discontinuities  due  to  corners  even  if  the  image  is  noisy. 


since  the  projection  process  involves  averaging,  which  o 


reduces 
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the  effects  of  noise. 


Each  peak  on  the  filtered  x(y)  projection  indic^t^s  ^column 
(row)  of  the  image  in  which  a  corner  may  b4  present,  so  that^ 
the  intersections  of  these  rows  and  column^  define  possible  cotner 


points.  Thus  corner  detection  need  be  applied  only  in  the 
vicinity  of  these  points,  which  are  only  a  small  fraction  of 
the  points  in  the  image.  To  further  reduce  the  computational 
cost,  the  neighborhoods  of  the  candidate  points  can  be  checked 
for  nonuniformity  of  gray  level  before  the  corner  detection 
process  is  applied  to  them.  Candidate  points  can  also  be  eli¬ 
minated  by  a  nonmaximum  suppression  process. 

Section  2  of  this  paper  describes  the  method  and  presents 
results  for  some  images  of  airplanes  and  an  image  of  a  tank. 
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Given  an  image  f(x,y),  its  x  and  y  projections  are  defined 


respectively  by 

p  (x)  =  Ef(x,y)  and  p(y)  =  Zf(x,y) 

y  x 

If  we  regard  the  image  as  the  sum  of  an  ideal  image  plus  zero- 
mean  random  noise,  the  projection  process  reduces  the  varia¬ 
bility  of  this  noise  (relative  to  the  signal  level  in  the  pro¬ 
jection)  .  This  may  be  stated  more  precisely  as  follows: 

Let  p(x)  =  E  f(x,y)  and  p(y)  *  E  f(x,y)  where  n  is  the  num 
n  n  ^ 

y  x 

ber  of  rows  and  nx  the  number  of  columns.  If  we  regard  the 
image  as  the  sum  of  an  ideal  image  f(x,y)  plus  zero  mean  random 
noise  z(x,y),  then  this  may  be  rewritten  as 


P  =  E  (f  (x,y)  +  z  (x, y) ) 
x  n 

y 

p  =  Z  (f  (x,y)  +  z  (x,y) ) 
nx 

In  order  to  compare  p  and  p  with  the  pixel  gray  level,  we  nor 

x  y 

malize  pv  and  p  : 
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Suppose  z(x,y)  is  a  random  variable  with  zero  mean  and  variance 


a  .  Then  the  second  term  can  be  regarded  as  a  sample  of  a 
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random  variable  with  zero  mean  and  variance  a  /n  (a  /n  ) . 
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Therefore,  the  projection  process  reduces  the  variability  of 
this  noise  [3]. 

Straight  edges  in  the  image  do  not  give  rise  to  discon¬ 
tinuities  in  the  projections  unless  they  are  nearly  parallel 
to  one  of  the  axes.  Corners  in  the  image,  on  the  other  hand, 
do  give  rise  to  slope  discontinuities.  Thus  corners  are 
generally  associated  with  slope  changes  in  the  projections, 
and  can  also  produce  discontinuities  if  one  side  of  the  corner 
lies  along  an  axis.  These  phenomena  are  illustrated  in 
Figure  1. 

To  enhance  these  (slope)  discontinuities  and  convert  them 
into  peaks,  we  can  apply  a  convolution  operation  to  the  pro¬ 
jections  which  acts  like  a  second-difference  operator,  thus 
converting  slope  changes  into  changes  of  magnitude.  A  simple 
example  of  such  an  operation  is  the  one-dimensional  digital 
Laplacian,  in  which  the  convolution  kernel  has  value  1  at  the 
origin  and  value  j  at  ±1.  Another  example  is  the  Shepp-Logan 
filter  [4],  in  which  the  kernel  has  value  approximately  0.2 
at  ±1.  Note  that  the  Laplacian  output  is  0  when  we  convolve 
it  with  a  linear  ramp,  whereas  the  Shepp-Logan  filter  yields 
nonzero  output  (we  use  a  truncated  form  of  this  filter) . 

Figure  2  shows  a  74x128  airplane  image  to  which  Gaussian 
noise  has  been  added  with  signal: noise  ratio=5:l*.  The  histo¬ 
grams  of  the  image  is  also  shown;  note  that  it  is  quite  unimodal. 

*s:n  is  defined  here  as  the  ratio  between  the  square  of  the  edge 
contrast  and  the  variance  of  the  noise;  see  [5]. 


Figure  3  shows  the  x  and  y  projections  of  the  image,  and  Figure  4 
shows  the  results  of  filtering  the  projections  of  the  image 
by  convolution  with  the  Shepp-Logan  and  Laplacian  filters, 
respectively  (a  constant  has  been  added  to  the  Laplacian  values) . 
Laplacian  filtering  was  used  in  our  experiments,  since  the 
Shepp-Logan  filter  preserves  the  original  values  to  a  great 
extent,  rather  than  simply  responding  to  changes. 

Any  peak  (“local  maximum)  on  the  filtered  projection  could 
indicate  the  presence  of  a  corner  in  the  image,  but  in  order  to 
eliminate  peaks  due  to  noise,  we  use  only  peaks  whose  heights 
are  in  the  top  30%  to  50%.  The  more  noisy  the  image,  the 
smaller  the  percentage  of  the  peaks  that  should  be  accepted. 


2.2  Comer  selection 

Each  peak  in  the  x(y)  projection  indicates  the  possible  pre¬ 
sence  of  a  corner  in  that  column  Crow)  of  the  image.  Thus  if 
we  have  m(n)  peaks  on  the  x(y)  projection,  we  have  mn  positions 
of  possible  corners  in  the  image,  i.e.,  the  positions  where  the 
m  columns  intersect  the  n  rows,  as  illustrated  in  Figure  5. 

For  the  airplane  image,  this  yields  150  positions,  as  seen  in 
Figure  6.  Note  that  in  Figure  5  the  four  corners  in  the  image 
give  rise  to  four  peaks  on  each  projection,  and  these  in  turn 
yield  sixteen  possible  corner  positions.  Similarly  in  Figure  6 
we  have  a  grid  of  possible  positions  for  the  corners.  Thus  we 
have  to  eliminate  most  of  the  possible  positions  by  checking 
them  for  the  presence  of  actual  corners. 

A  simple  method  of  eliminating  positions  at  which  comers 
are  not  present  is  to  test  for  uniformity  of  gray  level.  We 
use  a  3x3  window  centered  at  the  given  position;  if  the  gray 
level  range  r  in  this  window  is  less  than  a  specified  fraction 
of  the  average  gray  level  g  in  the  window,  we  call  the  window 
uniform.  To  reduce  the  effects  of  noise  on  the  measurement  of 


the  gray  level  range,  we  can  define  the  range  r  as  the  difference 
between  (e.g.)  the  averages  of  the  three  highest  and  three  lowest 


Sray  level*  in  the  window,  g3ma)t-g3min. 


Figure  7  shows  the 


results  of  eliminating  positions  for  which  r<pg,  where  p=.8,.7. 


and  .6.  The  results  are  shown  superimposed  on  the  non-noisy 
airplane  image.  For  this  image,  p=.7  seems  to  be  the  best  choice. 


When  we  use  p=.7,  the  number  of  candidate  corner  points  is 
reduced  to  62;  and  as  Figure  7  shows,  these  points  include 
many  of  the  major  corners  on  the  airplane.  Table  1  shows,  for 
each  candidate  point,  its  x  and  y  coordinates,  its  gradient 
magnitude  (G) ,  its  rate  of  change  of  gradient  direction  ("turn", 

T) ,  and  the  product  C=GT,  while  measures  "corner  merit". 

Figure  8  shows  a  histogram  of  the  values  of  C  using  p=.7. 

As  might  be  expected,  this  histogram  has  a  peak  at  low  values 
of  C,  and  if  we  threshold  C  to  eliminate  this  peak,  we  obtain 
a  selected  set  of  corner  points  which  still  includes  most  of 
the  airplane  corners.  Figure  9  shows  the  results  of  applying 
various  thresholds  to  C;  note  that  these  thresholds  eliminate 
most  of  the  candidate  points,  including  nearly  all  the  can¬ 
didates  that  lie  in  the  background. 

Another  way  to  eliminate  extraneous  corner  points  is  by 
nonmaximum  suppression.  Figure  10  shows  the  results  of  suppressing 
nonmaxima  of  C,  using  a  7x7  neighborhood  (i.e.,  a  point  is 
suppressed  if  there  is  a  point  within  checkerboard  distance  3 
with  a  higher  value  of  C) ,  after  thresholding  at  the  three  levels. 
Figure  11  shows  the  results  of  nonmaximum  suppression  after 
thresholding  at  500,  using  7x7,  11x11,  and  15x15  neighborhoods, 
corresponding  to  checkerboard  distances  3,  5,  and  7.  Another 
possibility  is  to  suppress  low  values  rowwise  or  columnwise, 
rather  than  in  a  small  neighborhood  -  e.g.,  to  discard  the  lowest 
value  of  C  in  each  row  (see  starred  entries  in  Table  1) .  Figure  12 


shows  the  results  of  doing  this  (without  thresholding)  and 
then  suppressing  nonmaxima  in  a  3x3,  7x7,  or  11x11  neighborhood. 

All  these  methods  of  picking  good  corner  points  give  us 
a  mixture  of  actual  airplane  corners  and  noise  comers.  In  a 
noisy  image,  it  is  not  possible  to  extract  the  correct  set  of 
corners  by  local  processing  alone,  even  with  the  aid  of  projec¬ 
tions,  since  the  noise  often  weakens  real  corners  while  pro¬ 
ducing  strong  false  corners.  However,  our  results  have  a  good 
intersection  with  the  set  of  correct  corners,  and  if  we  know  what 
shape  we  are  looking  for,  we  can  use  point  pattern  matching 
techniques  [6,7]  to  determine  that  a  subset  of  the  correct  cor¬ 
ners  is  present. 

Figure  13  shows  another  airplane  example:  (a)  airport 
scene;  (b)  airplane  window  and  its  projections;  (c)  possible 
corner  positions  and  result  of  selecting  nonuniform  positions; 

(d)  histogram  of  corner  merit  values  for  candidate  points; 

(e)  results  of  thresholding  and  of  nonmaximum  suppression; 

(f)  results  of  nonmaximum  suppression  without  thresholding.  As 
in  the  first  airplane  example,  good  sets  of  corner  points  are 
obtained  using  these  techniques.  Figure  14  shows  that  similar 
results  can  be  obtained  without  using  projections,  by  applying 
the  nonuni  fortuity  test  at  every  point  of  the  image  and  threshold¬ 
ing  or  nonmaximum-suppressing  the  corner  merits;  but  this  is 
considerably  more  expensive  (see  Section  3);  corner  merit  must  be 
computed  at  the  hundreds  of  points  that  satisfy  the  nonuniformity 
criterion. 


Figure  15  shows  another  example  using  a  noisy  infrared 
image  of  a  tank.  Here  the  nonuniformity  criterion  (p)  is 
smaller  because  the  image  has  lowe"  contrast.  Our  methods 
yield  less  than  20  corners,  most  of  which  lie  on  the  border 


of  the  tank. 


Our  method  compares  quite  favorably  in  computational  cost 

with  performing  corner  detection  at  every  pixel.  If  a  is 

the  fraction  of  rows  and  columns  for  which  peaks  are  detected 

on  the  projections,  then  the  fraction  of  pixels  that  are  can- 

.  .  2 

didate  comer  points  is  a  ;  e.g.,  a=25%  implies  that  only  about 
6%  of  the  pixels  are  candidates.  Moreover,  the  uniformity 
test,  which  tends  to  eliminate  a  majority  of  the  candidates,  is 
much  cheaper  than  the  corner  detection  operator;  the  latter 
involves  over  50  multiplications,  and  a  similar  number  of  addi¬ 
tions,  for  each  pixel.  The  overhead  of  computing  and  filtering 
the  projections  is  not  large;  computing  them  costs  about  2  addi¬ 
tions  per  pixel,  and  filtering  costs  on  the  order  of  the  square 
root  of  the  number  of  pixels. 

We  have  seen  that  corner  detection  in  a  noisy  image  is  not 
a  very  reliable  process.  Using  projections,  however,  in  which 
the  averaging  helps  to  combat  the  effects  of  the  noise,  we  can 
eliminate  most  of  the  non-corner  points  without  losing  more 
than  a  fraction  of  the  corner  points.  We  can  further  reduce 
the  set  of  candidate  points  by  applying  simple  tests  for  gray 
level  nonuniformity.  When  we  apply  corner  detection  to  the 
remaining  candidates,  and  use  thresholding,  nonmaximum  suppression 
or  both,  we  obtain  a  set  of  corners  comparable  to  the  set  that 
could  be  obtained  by  applying  corner  detection  in  the  image,  and 
at  much  lower  computational  cost. 
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Table  1. 

Corner 

merits  for 

the  candidate 

corner 

PC 

in  Figure  7  (middle:  p=.7).  X,Y  are  the  coordinates 
of  the  point,  T  is  the  rate  of  change  of  gradient 
direction,  G  is  the  gradient  magnitude,  and  C=GT. 
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Table  1,  cont'd 
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(b) 

Figure  4.  Result  of  filtering  the  projections  using 
(a)  the  truncated  Shepp-Logan  filter,  (b)  the  one¬ 
dimensional  digital  Laplacian. 


M 
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Figure  5.  If  there  are  m(n)  direction  changes  on 
the  projections,  there  are  mn  possible  corner  posi¬ 
tions  in  the  image.  In  this  simple  illustration, 
m=n=4 . 


Figure  6.  Possible  corner  positions  (165)  for  the  air 
plane  image. 


Figure  9.  Results  of  thresholding  C  at  500 
(left) ,  750  (middle) ,  and  1000  (right)  when  we 
used  p=.  7 . 


Figure  10.  Results  of  nonmaximum  suppression,  using 
a  7x7  neighborhood,  applied  to  Figure  9. 


I  <  Figure  11.  Results  of  nonmaximum  suppression  after 

thresholding  at  500,  using  neighborhoods  of  7x7  (left), 
r 11x11  (middle)  ,  and  15x15  (right)  . 


Figure  12.  Results  of  nonmaximum  suppression  after 
eliminating  the  weakest  corner  in  each  row  and  column 
(starred  entries  in  Table  1) ,  using  neighborhoods  of 
3x3  (left) ,  7x7  (middle) ,  and  llxll  (right) . 


Figure  13,  cont'd.  (c)  Candidate  corner  points,  a: 
results  of  applying  nonuniformity  test  r>pg  for  p 
. 8  ,  . 7  ,  and  . 6 . 

(d)  Histogram  of  corner  merit  values  for  p= . 6 . 


(f) 

Figure  13,  cont'd:  (e)  Left  column:  airplane  window 
and  candidate  points  using  p=.6  (same  as  rightmost  out- 
put  in  Figure  13c).  Bottom  row  (cols.  2-4):  results  of 
thresholding  the  corner  merit  values  at  250,  500  and  ,50. 
Top  row:  col.  2  same  as  bottom  row;  cols.  3-4,  results  of 
nonmaximum  suppression  using  5x5  and  7x7  neighborhoods. 

-7 /column:  candidate  points  using  p=. 6  (bottom)  and 
./(top)  (same  as  two  rightmost  outputs  in  Figure  13c).  Re¬ 
maining  columns:  results  of  nonmaximum  suppression  usinq 
3x3,  5x5,  and  7x7  neighborhoods. 


Figure  14.  Results  of  applying  the  nonuniformity 
test  at  every  goint  of  the  image  Top  row:  Points 
for  which  r>.7g  (left);  results  of  thresholding  C 
at  250,500,  and  700.  Bottom  row:  Results  of  non¬ 
maximum  suppression  without  thresholding.  Middle 
row:  Results  of  thresholding  at  500  and  nonmaximum 

suppression  using  3x3,  5x5,  and  7x7  neighborhoods. 


Figure  15(a):  Infrared  image  of  a  tank  and  its 
x  and  y  projections,  as  well  as  its  histogram. 
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